A Benchmarking Framework For Proof Systems